Online fuzzy medoid based clustering algorithms

نویسنده

  • Nicolas Labroche
چکیده

This paper describes two new online fuzzy clustering algorithms based on medoids. These algorithms have been developed to deal with either very large datasets that do not fit in main memory or data streams in which data are produced continuously. The innovative aspect of our approach is the combination of fuzzy methods, which are well adapted to outliers and overlapping clusters, with medoids and the introduction of a decay mechanism to adapt more effectively to changes over time in the data streams. The use of medoids instead of means allows to deal with non numerical data (e.g. sequences . . . ) and improves the interpretability of the cluster centers. Experiments conducted on artificial and real datasets show that our new algorithms are competitive with state-ofthe-art clustering algorithms in terms of purity of the partition, F1 score and computation times. Finally, experiments conducted on artificial data streams show the benefit of our decay mechanism in the case of evolving distributions.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Clustering Moving Objects via Medoid Clusterings

Modern geographic information systems do not only have to handle static information but also dynamically moving objects. Clustering algorithms for these moving objects provide new and helpful information, e.g. jam detection is possible by means of these algorithms. One of the main problems of these clustering algorithms is that only uncertain positional information of the moving objects is avai...

متن کامل

Low-complexity fuzzy relational clustering algorithms for Web mining

This paper presents new algorithms (Fuzzy c-Medoids or FCMdd and Robust Fuzzy c-Medoids or RFCMdd) for fuzzy clustering of relational data. The objective functions are based on selecting c representative objects (medoids) from the data set in such a way that the total fuzzy dissimilarity within each cluster is minimized. A comparison of FCMdd with the well-known Relational Fuzzy c-Means algorit...

متن کامل

Multiple Medoids based Multi-view Relational Fuzzy Clustering with Minimax Optimization

Multi-view data becomes prevalent nowadays because more and more data can be collected from various sources. Each data set may be described by different set of features, hence forms a multi-view data set or multi-view data in short. To find the underlying pattern embedded in an unlabelled multiview data, many multi-view clustering approaches have been proposed. Fuzzy clustering in which a data ...

متن کامل

Document Categorization with MAJORCLUST

This paper investigates the text categorization capabilities of two special clustering algorithms: Fuzzy k-Medoid and MAJORCLUST. Aside from quantifying the categorization performance of the mentioned algorithms, our experimental setting will also help to answer special questions related to clustering problems such as cluster number determination or cluster quality evaluation.

متن کامل

Fuzzy relational clustering around medoids: A unified view

Medoid-based fuzzy clustering generates clusters of objects based on relational data, which records pairwise similarities or dissimilarities among objects. Compared with single-medoid based approaches, our recently proposed fuzzy clustering with multipleweighted medoids has shown superior performance in clustering via experimental study. In this paper, we present a new version of fuzzy relation...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Neurocomputing

دوره 126  شماره 

صفحات  -

تاریخ انتشار 2014